Adjudicator Agreement and System Rankings for Person Name Search

نویسندگان

Mark Arehart

Chris Wolf

Keith J. Miller

چکیده

We have analyzed system rankings for person name search algorithms using a data set for which several versions of ground truth were developed by employing different means of resolving adjudicator conflicts. Thirteen algorithms were ranked by F-score, using bootstrap resampling for significance testing, on a dataset containing 70,000 romanized names from various cultures. We found some disagreement among the four adjudicators, with kappa ranging from 0.57 to 0.78. Truth sets based on a single adjudicator, and on the intersection or union of positive adjudications produced sizeable variability in scoring sensitivity – and to a lesser degree rank order – compared to the consensus truth set. However, results on truth sets constructed by randomly choosing an adjudicator for each item were highly consistent with the consensus. The implication is that an evaluation where one adjudicator has judged each item is nearly as good as a more expensive and labor-intensive one where multiple adjudicators have judged each item and conflicts are resolved through voting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Personalizing Local Search with Twitter

We propose a new ranking model for personalized local search. While local search verticals such as Google Local and Yahoo! Local incorporate physical proximity and public sentiment (reviews and ratings), their rankings reflect minimal personalization. We personalize local search by integrating Twitter social network structure and content analysis. Specifically, we infer sentiment for tweets by ...

متن کامل

تشخیص اسامی اشخاص با استفاده از تزریق کلمه‌های نامزد اسم در میدان‌های تصادفی شرطی برای زبان عربی

Named Entity Recognition and Extraction are very important tasks for discovering proper names including persons, locations, date, and time, inside electronic textual resources. Accurate named entity recognition system is an essential utility to resolve fundamental problems in question answering systems, summary extraction, information retrieval and extraction, machine translation, video interpr...

متن کامل

Impact of HIT Design on Crowdsourcing Relevance

In this paper we investigate the design and implementation of effective crowdsourcing tasks in the context of book search evaluation. We observe the impact of aspects of the Human Intelligence Task (HIT) design on the quality of relevance labels provided by the crowd. We assess the output in terms of label agreement with a gold standard data set and observe the effect of the crowdsourced releva...

متن کامل

Google's PageRank and beyond - the science of search engine rankings

Why doesn't your home page appear on the first page of search results, even when you query your own name? How do other web pages always appear at the top? What creates these powerful rankings? And how? The first book...

متن کامل

Being Omnipresent To Be Almighty: The Importance of the Global Web Evidence for Organizational Expert Finding

Modern expert finding algorithms are developed under the assumption that all possible expertise evidence for a person is concentrated in a company that currently employs the person. The evidence that can be acquired outside of an enterprise is traditionally unnoticed. At the same time, the Web is full of personal information which is sufficiently detailed to judge about a person’s skills and kn...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Adjudicator Agreement and System Rankings for Person Name Search

نویسندگان

چکیده

منابع مشابه

Personalizing Local Search with Twitter

تشخیص اسامی اشخاص با استفاده از تزریق کلمه‌های نامزد اسم در میدان‌های تصادفی شرطی برای زبان عربی

Impact of HIT Design on Crowdsourcing Relevance

Google's PageRank and beyond - the science of search engine rankings

Being Omnipresent To Be Almighty: The Importance of the Global Web Evidence for Organizational Expert Finding

عنوان ژورنال:

اشتراک گذاری